Topic Tracking for Punjabi Language
نویسندگان
چکیده
This paper introduces Topic Tracking for Punjabi language. Text mining is a field that automatically extracts previously unknown and useful information from unstructured textual data. It has strong connections with natural language processing. NLP has produced technologies that teach computers natural language so that they may analyze, understand and even generate text. Topic tracking is one of the technologies that has been developed and can be used in the text mining process. The main purpose of topic tracking is to identify and follow events presented in multiple news sources, including newswires, radio and TV broadcasts. It collects dispersed information together and makes it easy for user to get a general understanding. Not much work has been done in Topic tracking for Indian Languages in general and Punjabi in particular. First we survey various approaches available for Topic Tracking, then represent our approach for Punjabi. The experimental results are shown.
منابع مشابه
Sentiment Analysis on Punjabi News Articles Using SVM
Sentiment analysis is a field of Natural Language Processing and it is the most trending field of research. In the process of text mining that is used to find out people’s opinion about a particular product, topic and predicting market trends or outcomes of elections, detecting and classifying sentiments from the text. Sentiment analysis on Punjabi language is to be performed because of increas...
متن کاملPunjabi Text Clustering by Sentence Structure Analysis
Punjabi Text Document Clustering is done by analyzing the sentence structure of similar documents sharing same topics and grouping them into clusters. The prevalent algorithms in this field utilize the vector space model which treats the documents as a bag of words. The meaning in natural language inherently depends on the word sequences which are overlooked and ignored while clustering. The cu...
متن کاملAutomatic Text Summarization System for Punjabi Language
This paper concentrates on single document multi news Punjabi extractive summarizer. Although lot of research is going on in field of multi document news summarization systems but not even a single paper was found in literature for single document multi news summarization for any language. It is first time that this system has been developed for Punjabi language and is available online at: http...
متن کاملAn Automatic Spontaneous Live Speech Recognition System for Punjabi Language Corpus
In spontaneous Punjabi speech model, the speech is basically non-planed and non designed, there are generally depicted by repetitions, preservation, wrong start, half-spoken words and non-planned words, silence gap etc. In a system of Punjabi speech detection including vocabulary, the identification needs the evaluation among the audio signal of the utterance and the variety of utterances of th...
متن کاملAutomatic Spontaneous Speech Recognition for Punjabi Language Interview Speech Corpus
Automatic Speech Recognition presents natural phenomena for the communication among man and machine. The purpose of Speech Recognition speech system is to convert the sequence of sound units in the form of text description. The main objective of the research work is to develop the automatic spontaneous speech model for the Punjabi language. Punjabi is categorized as a constituent of the Indo-Ar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011